Data Mining: A Preprocessing Engine

نویسندگان
چکیده

منابع مشابه

Data Mining: A Preprocessing Engine

This study is emphasized on different types of normalization. Each of which was tested against the ID3 methodology using the HSV data set. Number of leaf nodes, accuracy and tree growing time are three factors that were taken into account. Comparisons between different learning methods were accomplished as they were applied to each normalization method. A new matrix was designed to check for th...

متن کامل

Enhancing Learning from Imbalanced Classes via Data Preprocessing: A Data-Driven Application in Metabolomics Data Mining

This paper presents a data mining application in metabolomics. It aims at building an enhanced machine learning classifier that can be used for diagnosing cachexia syndrome and identifying its involved biomarkers. To achieve this goal, a data-driven analysis is carried out using a public dataset consisting of 1H-NMR metabolite profile. This dataset suffers from the problem of imbalanced classes...

متن کامل

MIDCA --- A Discretization Model for Data Preprocessing in Data Mining

Decision tree is one of the most widely used and practical methods in data mining and machine learning discipline. However, many discretization algorithms developed in this field focus on univariate only, which is inadequate to handle the critical problems especially owned by medical domain. In this paper, we propose a new multivariate discretization method called Multivariate Interdependent Di...

متن کامل

A Framework for Trajectory Data Preprocessing for Data Mining

Trajectory data play a fundamental role to an increasing number of applications, such as traffic control, transportation management, animal migration, and tourism. These data are normally available as sample points. However, for many applications, meaningful patterns cannot be extracted from sample points without considering the background geographic information. In this paper we present a fram...

متن کامل

DB-HReduction: A data preprocessing algorithm for data mining applications

Data preprocessing is an important and critical step in the data mining process and it has a huge impact on the success of a data mining project. In this paper, we present an algorithm DBHReduction, which discretizes or eliminates numeric attributes and generalizes or eliminates symbolic attributes very efficiently and effectively. This algorithm greatly decreases the number of attributes and t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Computer Science

سال: 2006

ISSN: 1549-3636

DOI: 10.3844/jcssp.2006.735.739